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This paper studies many-server limits for multi-server queues 
that have a phase-type service time distribution and allow for cus- 
tomer abandonment. The first set of limit theorems is for critically 
loaded G/Ph/n+ GI queues, where the patience times are indepen- 
dent and identically distributed following a general distribution. The 
next limit theorem is for overloaded G/Ph/n + M queues, where 
the patience time distribution is restricted to be exponential. We 
prove that a pair of diffusion-scaled total-customer-count and server- 
allocation processes, properly centered, converges in distribution to 
a continuous Markov process as the number of servers n goes to 
infinity. In the overloaded case, the limit is a multi-dimensional dif- 
fusion process, and in the critically loaded case, the limit is a sim- 
ple transformation of a diffusion process. When the queues are crit- 
ically loaded, our diffusion limit generalizes the result by Puhalskii 
and Reiman (2000) for GI /Ph/n queues without customer abandon- 
ment. When the queues are overloaded, the diffusion limit provides a 
refinement to a fluid limit and it generalizes a result by Whitt (2004) 
for M/M/n/ + M queues with an exponential service time distribu- 
tion. The proof techniques employed in this paper are innovative. 
First, a perturbed system is shown to be equivalent to the original 
system. Next, two maps are employed in both fluid and diffusion 
scalings. These maps allow one to prove the limit theorems by ap- 
plying the standard continuous-mapping theorem and the standard 
random-time-change theorem. 
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1. Introduction. This paper studies many-server limits for multi-server 
queues that allow for customer abandonment. These queues are assumed 
to have a phase-type service time distribution. We consider two separate 
parameter regimes: one for critically loaded many-server queues and the 
other for overloaded many-server queues. 

As argued in the seminal paper of Halfin and Whitt (1981), for a crit- 
ically loaded many-server queue, the system provides high-quality service 
and at the same time achieves high server utilization. Thus, the critically 
loaded parameter regime is also known as the Quality and Efficiency-Driven 
(QED) limiting regime or the Halfin-Whitt limiting regime. For the over- 
loaded M/M/n + M model, Whitt (2004) demonstrates that a certain fluid 
approximation can be useful in predicting the steady- state performance of 
the multi-server system. He further demonstrates that a diffusion limit pro- 
vides a refined approximation. 

Our first set of results is for critically loaded G/Ph/n + GI queues, whose 
patience times are independent and identically distributed (i.i.d.) following 
a general distribution. In Theorem 1, we prove that a pair of diffusion-scaled 
total- customer- count and server-allocation processes converges in distribu- 
tion to a continuous Markov process (X,Z). In Theorem 2, we prove that 
the diffusion-scaled customer- count-vector process converges to a diffusion 
process Y. In Theorem 3, the diffusion-scaled virtual waiting time process 
converges in distribution to a constant multiple of (A) + , which serves as 
the limit of the diffusion-scaled queue-length process. Our second result is 
for overloaded G/Ph/n + M queues, whose patience time distribution is re- 
stricted to be exponential. In Theorem 4, we prove that the pair of diffusion- 
scaled total- customer- count and server- allocation processes converges in dis- 
tribution to a diffusion process. Although the limit (X,Z) in Theorem 1 is 
not a diffusion process in a strict sense (see discussions below the state- 
ment of Theorem 2), we still call it a diffusion limit because it is a simple 
transformation of the diffusion process Y in Theorem 2. This terminology 
is consistent with the usage in conventional heavy traffic, where the limit 
process is often a constrained diffusion process [see, e.g., Reiman (1984)]. 

In the critically- loaded regime, Halfin and Whitt (1981) is the first paper 
to establish a diffusion limit for the GI /M/n model. Puhalskii and Reiman 
(2000) establish a diffusion limit for the GI / Ph/n model, where the service 
time distribution is phase-type. Garnett, Mandelbaum and Reiman (2002) 
prove a diffusion limit for the M/M/n-\-M model, which allows for customer 
abandonment. Whitt (2005) generalizes the result to the G/M/n + M model. 
In the same paper, Whitt proves a stochastic-process limit for the G/H^/n 
model; this limit is not a diffusion process although a simple transformation 
of it is a diffusion process. Our first set of results, Theorems 1 and 2, extends 
the result of Puhalskii and Reiman (2000) to the Gj Ph/n + GI model, which 
allows for customer abandonment with a general patience time distribution. 
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It also extends the work of Garnett, Mandelbaum and Reiman (2002) and 
Whitt (2005) to allow for phase-type service time distributions. For the 
overloaded G/Ph/n + M model, Theorem 4 generalizes Whitt (2005) to the 
G/Ph/n + M model. The diffusion limit in the theorem provides a refinement 
to a fluid limit. 

In addition to these limit theorems, the techniques used in the proofs 
are innovative. When the patience time distribution is exponential, we first 
establish a sample-path representation for our G/Ph/n + M model. This 
representation allows us to obtain the total-customer-count and the server- 
allocation processes as a map of primitive processes with a random time 
change. These primitive processes are either assumed or known to satisfy 
functional central limit theorems (FCLTs). Therefore, our limits follow from 
the standard continuous-mapping theorem and the standard random-time- 
change theorem [see, e.g., Ethier and Kurtz (1986) and Billingsley (1999)]. 
When the queues are critically loaded, a result of Dai and He (2010) proves 
that the performance of these queues is not sensitive to the distribution of 
patience times, thus allowing us to prove Theorems 1-3 for general patience 
time distributions with a mild regularity condition. 

Halfm and Whitt (1981), Garnett, Mandelbaum and Reiman (2002) and 
Whitt (2004) all use Stone's theorem to prove diffusion limit theorems in the 
critically-loaded regime. Stone (1963) is set up for convergence of Markov 
chains to a diffusion process. This setting makes the generalization to non- 
renewal arrival processes difficult. Puhalskii and Reiman (2000) also use 
the continuous-mapping approach for the Gl/Ph/n model. They employ a 
different sample-path representation for the total-customer-count process. 
Their representation requires them to use extensively martingale FCLTs in 
their proofs, whereas our approach uses standard FCLTs for random walks 
and Poisson processes. Pang, Talreja and Whitt (2007) review a number of 
sample-path representations and martingale proofs for many-server heavy 
traffic limits, and Whitt (2007) surveys the proof techniques for establish- 
ing martingale FCLTs. Our proofs show that for multi-server queues with a 
phase-type service time distribution and an exponential patience time dis- 
tribution, there is a general approach to proving limit theorems, without 
employing martingale FCLTs. Note that when the patience time distribu- 
tion is general, our proofs for the diffusion limits in the critically-loaded 
regime rely on a result of Dai and He (2010), which is proved by using a 
martingale FCLT. 

Our sample-path representation is based on the equivalence of our multi- 
server system to a perturbed system as illustrated in Tezcan (2006). This 
representation has been used in Dai and Tezcan (2005), Tezcan and Dai 
(2010) and Dai and Tezcan (2008) for multi-server-pool systems when ser- 
vice and patience times have exponential distributions. The sample-path ar- 
gument has been explored previously in the setting of Markovian networks 
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in Mandelbaum, Massey and Reiman (1998) for strong approximations and 
Mandelbaum and Pats (1998) for general state-dependent networks. 

In our continuous-mapping approach, we have heavily exploited some 
maps from H> K+1 to H> K+1 , where K is the number of phases in the ser- 
vice time distribution. Variants of these maps have been employed in the 
literature [see, e.g., Mandelbaum, Massey and Reiman (1998), Reed (2009), 
Tezcan and Dai (2010), Dai and Tezcan (2008) and Pang, Talreja and Whitt 
(2007)]. We use these maps not just in diffusion scaling but also in fluid scal- 
ing. Using a map twice, one for each scaling, allows us to obtain diffusion 
limits as a simple consequence of the standard continuous-mapping theorem 
and the random-time-change theorem. In the seminal paper, Reiman (1984) 
proves a conventional heavy traffic limit theorem for generalized Jackson 
networks. Our approach resembles the work of Johnson (1983), which also 
uses a multi-dimensional Skorohod map twice and provides a significant 
simplification of Reiman's original proof. 

For the G/GI/n model, Reed (2009) proves a many-server limit for the 
total-customer-count process in the critically-loaded regime; his assumption 
on the service time distribution is completely general, and his limit is not a 
Markov process. This work is generalized in Mandelbaum and Momcilovic 
(2009) to allow for customer abandonment. For the overloaded G/GI/n 
model, Puhalskii and Reed (2010) prove a finite-dimensional-distribution 
limit for the total-customer-count process. Jelenkovic, Mandelbaum and 
Momcilovic (2004) prove a limit theorem for the GI/D/n model. Gamarnik 
and Momcilovic (2008) study a many-server limit of the steady-state dis- 
tribution of the GI / GI j n model, where the service times are lattice- valued 
with a finite support. When the service time distribution is general, measure- 
valued processes have been used to give a Markovian description of the sys- 
tem. Kaspi and Ramanan (2010) obtain a measure-valued fluid limit for 
the G/GI/n model. Kang and Ramanan (2010) obtain a measure-valued 
fluid limit for the G/GI/n + G model with customer abandonment, and 
Zhang (2009) obtains a similar measure-valued fluid limit independently. 
Their work partially justifies the fluid model in Whitt (2006). 

The remainder of the paper is organized as follows. In Section 2, we intro- 
duce the Gj GI /n+ GI model, in an asymptotic framework, and phase-type 
distributions. The main results, Theorems 1-4, are stated in Section 3; a 
roadmap for the proofs is introduced in Section 3.3. In Section 4 we introduce 
a perturbed system that is equivalent to a Gf Ph/n + M queue and derive 
the dynamical equations that the perturbed system must obey. The proofs 
for the diffusion limits of G/ Ph/n + M queues, in both the critically- loaded 
and the overloaded regimes, are given in Section 5. Section 6 is dedicated 
to the proof for the diffusion limit of G/ Ph/n + GI queues in the critically- 
loaded regime. In Appendix A, we introduce a continuous map and establish 
various properties for the map. The state-space-collapse lemma is proved in 
Appendix B, in which Theorem 3 and Lemma 3 are also proved. 
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Notation. All random variables and processes are defined on a common 
probability space (f2,.F, P) unless otherwise specified. The symbols Z, Z + , 
N, R and M + are used to denote the sets of integers, nonnegative integers, 
positive integers, real numbers and nonnegative real numbers, respectively. 
For deN, R rf denotes the d-dimensional Euclidean space; thus, R = R 1 . The 
space of functions / : R+ — > R rf that are right-continuous on [0, oo) and have 
left limits in (0, oo) is denoted by B(R + ,R rf ) or simply B rf ; similarly, with 
T > 0, the space of functions /: [0, T] — > R rf that are right-continuous on 
[0,T) and have left limits in (0,T] is denoted by B([0,T],R d ). For / G B d , 
f(t—) denotes its left limit at t > 0. For a sequence of random elements 
{X n ,n G N} taking values in a metric space, we write X n =4> X to denote 
the convergence of X n to X in distribution. Each stochastic process whose 
sample paths are in 3 d is considered to be a B rf -valued random element. 
The space B d is assumed to be endowed with the Skorohod Ji-topology 
[see Ethier and Kurtz (1986) or Billingsley (1999)]. Given we set 

x + = max{x, 0}, x~ = max{— x, 0} and L^cJ = max{j G Z : j < x}. All vectors 
are envisioned as column vectors. For a .fT-dimensional vector u G R , we 
use Uk to denote its feth entry and diag(u) for the K x K diagonal matrix 
with Arth diagonal entry u^; we put \u\ = maxi<£,<^-|u/%| . For a matrix M, 
M' denotes its transpose and Mjk denotes its (j, k)th entry. We reserve / for 
the K x K identity matrix, e for the iT-dimensional vector of ones and e k 
for the .ff-dimensional vector with kth entry one and all other entries zero. 

2. A G/Ph/n + GI queue. We first introduce a Gj GI/n+ GI queue in 
Section 2.1. We then define a G/Ph/n+ GI queue by restricting the service 
time distribution to be phase-type. Phase-type distributions are defined in 
Section 2.2. 

2.1. A G j GI /n + GI queue. A G/GI/n queue is a classical stochastic 
system that has been extensively studied in the literature [see, e.g., Kiefer 
and Wolfowitz (1955), Borovkov (1967) and Iglehart and Whitt (1970), 
among others]. In such a system, there are n identical servers. The service 
times {v{, i G N} are a sequence of i.i.d. random variables, where Vi is the ser- 
vice time of the ith customer entering service after time 0. The service time 
distribution is general (the GI in the G/GI/n notation), although for the 
rest of this paper we restrict it to be a phase-type distribution. The arrival 
process E = {E(t),t > 0} is assumed to be general (the first G), where E(t) 
denotes the number of customer arrivals to the system by time t. Upon his 
arrival to the system, a customer gets into service immediately if there is an 
idle server; otherwise, he waits in a waiting buffer that holds a first-in-first- 
out (FIFO) queue. The buffer size is assumed to be infinite. When a server 
finishes a service, the server removes the leading customer from the waiting 
buffer and starts to serve the customer; when the queue is empty, the server 
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begins to idle. In our model, each customer has a patience time: when a 
customer's waiting time in queue exceeds his patience time, the customer 
abandons the system without any service. Retrial is not modeled in this pa- 
per. We assume that the patience times of customers who arrive after time 0, 
form a sequence of i.i.d. random variables that have a general distribution. 
Thus our model is a G/ GI jn + GI queue, where +GI signifies the general 
patience time distribution. When the patience times are i.i.d. following an 
exponential distribution, the resulting system is a G/GI/n + M queue. 

We focus on systems when the arrival rate is high. Specifically, we consider 
a sequence of Gj GI/n + GI systems indexed by n, with E n being the arrival 
process of the nth system. We assume that for the nth system, the arrival 
rate A n — > oo as the number of servers n — > oo , while the service time and 
the patience time distributions do not change with n. We further assume 
that 

A n — / A n \ 

(2.1) lim — = A>0, limVnlA = /3/i for some /3sR 

and 

(2.2) E n ^E asn^oo, 
where E is a Brownian motion and 

(2.3) E n (t) = -^E n (t), E n (t) = E n (t)-X n t fort>0. 

Jn 



We use m to denote the mean service time; thus u = 1 /m is the mean service 
rate. For future purposes, let 

(2.4) p n = — and p=-. 

np u 

Because customer abandonment is allowed, it is not necessary to assume 
p n < 1 or p< 1. Condition (2.1) implies that 

lim Jn(p - p n ) = p. 

n— ¥oo 

When p = l, the sequence of systems is critically loaded in the limit, and is 
said to be in the Quality and Efficiency-Driven (QED) regime or the Halfin- 
Whitt regime. When p > 1, the sequence of systems is overloaded, and is 
said to be in the Efficiency-Driven (ED) regime. Our focus is both the QED 
and the ED regimes. 



2.2. Phase-type distributions. In this section, we introduce phase-type 
distributions. Such a distribution is assumed to have K > 1 phases. The set 
of phases is assumed to be JC = {1,. . . ,K}. Each phase-type distribution 
has a set of parameters p, v and P, where p is a X-dimensional vector of 
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nonnegative entries whose sum is equal to one, v is a i^-dimensional vector 
of positive entries and P is a K x K sub-stochastic matrix. We assume that 
the diagonals of P are zero, namely, 

(2.5) Pu = for i = l,...,K, 
and P is transient, namely, 

(2.6) I — P is invertible. 

A (continuous) phase-type random variable v is defined as the time until 
absorption in an absorbing state of a continuous-time Markov chain. With 
p, v and P, the continuous-time Markov chain can be described as follows. 
It has K + 1 states, 1, . . . ,K,K + 1, with state K + 1 being absorbing. The 
rate matrix (or generator) of the Markov chain is 



G 



F h 




where F = diag(i^)(P — /) is a K x K matrix and h = —Fe is a X-dimensional 
vector. 



Definition 1. A continuous phase-type random variable v with param- 
eters p, v and P, denoted as v ~ Ph(p,v, P), is defined to be the first time 
until the continuous-time Markov chain with initial distribution p and rate 
matrix G reaches state K + 1 . 



Given condition (2.5), the rate matrix G and (v,P) are uniquely deter- 
mined from each other. It is well known [see, e.g., Latouche and Ramaswami 
(1999)] that 

F[v < x] = 1 — p' exp(i ? x)e for x > 0. 

Because parameters p, v and P uniquely determine a phase-type distribu- 
tion, we free symbols F and G so that they can be reused in the rest of the 
paper. 

For our purposes, we provide an alternative way to sample a Ph(p,v,P) 
random variable. We first sample a sequence of phases k%,...,kL in K, = 
{1,...,K} as follows. We sample phase k\ following distribution p on fC. 
Assume ki,...,ki G 1C have been sampled. Setting j = ki, sample a phase 
from {1, . . . ,K,K + 1} following a distribution that is determined by the 
jth row of P; the probability of getting phase K + 1 is 1 — X^=i Pjl — 0- 
The resulting phase is denoted by ki+\. When ki+i = K + 1, terminate the 
process and set L = i; otherwise, continue the sampling process. Because 
the matrix P is assumed to be transient, one has L < oo almost surely. 
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Let £1, . . . ,£l be independently sampled from exponential distributions with 
respective rates , . . . , Uk L ■ Then 

L 

(2.7) « = J>- 

i=l 

3. Main results. In this section, we present two sets of results. The first 
set, presented in Section 3.1, is for critically loaded G/Ph/n + GI queues. 
The second set, presented in Section 3.2, is for overloaded G/Ph/n + M 
queues. A roadmap for proving these results is given in Section 3.3. 

We consider a sequence of G/Ph/n+ GI queues, indexed by the number of 
servers n, which satisfies condition (2.1). We assume that the service times 
follow a phase-type distribution Ph(p,i/, P). Each customer's service time 
can be decomposed into a number of phases as in (2.7). When a customer is 
in service, it must be in one of the K phases of service. Let Z^(t) denote the 
number of customers in phase k service in the nth system at time t; service 
times in phase k are exponentially distributed with rate v\.. We use Z n (t) to 
denote the corresponding i^-dimensional vector. We call Z n = {Z n (t),t > 0} 
the server- allocation process. Let N n (t) denote the number of customers in 
the nth system at time t, either in queue or in service. Setting 

(3.1) X n (t) = N n (t) -n fort>0, 

we call X n = {X n (t),t > 0} the total- customer- count process in the nth 
system. One can check that (X n (t)) + is the number of customers waiting 
in queue at time t, and (X n (t))~ is the number of idle servers at time t. 
Clearly, 

(3.2) e'Z n {t) = n-(X n {t))- for t > 0. 

The processes X n and Z n describe the "state" of the system as time evolves. 
Hereafter, they are called the state processes for the nth system. 

The customers in service are distributed among the K phases following a 
distribution 7, given by 

(3.3) 7 = VR^P, 

(3.4) R=(I - P')diag(u). 

One can check that X]fc=i7fc = 1; an d one interprets 7^ to be the fraction of 
phase k load on the n servers. 

The preceding paragraph suggests the following centering for the server- 
allocation process: 

Z n (t) = Z n (t)-nj fort>0. 
Define the corresponding diffusion-scaled process 

Z n (t) = ~^Z n {t) fort>0. 
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3.1. Diffusion limits for critically loaded Gj Ph/n + G I queues. Through- 
out Section 3.1, we assume that p = 1 and that the patience times of cus- 
tomers who arrive after time are i.i.d. having distribution function F, 
which satisfies 

(3.5) F(0)=0 and a = limx~ 1 F(x) < oo. 

Note that for exponentially distributed patience times, a turns out to be 
the rate of the exponential distribution. 

Since p is assumed to be 1, we define the diffusion-scaled total-customer- 
count process X n by 

(3.6) X n {t) = ^=X n {t) fort>0. 

In 



[When p > 1, the definition of X n (t) will be modified, which is given in 
(3.27).] We assume that 

(3.7) (X n (0),Z n (0)) (X{0),Z(0)) asn^oo 

for a pair of random variables (X(0), Z(0)). 

The random variables X(0) and Z(0) are assumed to be defined on some 
probability space (fi, J 7 , P), which is rich enough so that stochastic processes 
E, . . . , $ K and S defined on this space are independent of (X(0),Z(0)). 
Here, E is a one-dimensional driftless Brownian motion, and $°,...,$> K 
and S are iC-dimensional driftless Brownian motions. These Brownian mo- 
tions, possibly degenerate, are mutually independent and start from 0; the 
variance of E is \(? a for some constant c\ > 0, and the covariance matrices 
of <&°,...,<& K and S are H°,...,H and diag(f), respectively, where for 
k = 0, . . . , K , the K x K matrix H k is given by 



^ \ —v k iV% otherwise 



Pi(l-Pj), if* = i, 

iPj: 

with p° =p and p k being the kth column of P' . 
To state the main theorems of this paper, let 

(3.9) U(t)=X(0) + E(t) - p(3t + e'M(t), 

(3.10) V(t) = (I- pe')Z(0) + + (J - pe')M(t), 
where 

K 

(3.11) M(t) = ^ d fc K7fc*) " U " i")5(7*) 

fc=i 

for i > 0. The process (U, V) is a (ET + l)-dimensional Brownian motion; 
it is degenerate because e'V(t) = for t > 0. [When p > 1, the definition 
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of U will be modified in (3.28).] Before we state the first theorem of this 
paper, we present the following lemma, which is a corollary of Lemma 9 in 
Appendix A. 

Lemma 1. Let p be a K -dimensional vector that is the distribution of 
initial phases of the phase-type service times, R be the K x K matrix defined 
by (34) and a > be defined by (3.5). (a) For each (u,v) G O k+1 with 
u(t) G R and v(t) G M A for t>0, there exists a unique (x,z) G O k+1 with 
x(t) G R and z(t) G R K for t > 0, such that 

(3.12) x(t) = u{t)-a I (x(s)) + ds - e'R [ z(s)ds, 



Jo 



(3.13) z(t) = v(t) - p(x(t))~ — {I — pe')R f z{s)ds 

Jo 

for t>0. (b) For each (u,v) G B K+1 , define $(u,v) = (x,z) G B K+1 , where 
(x, z) satisfies (3.12) and (3.13). The map $ is well defined and is continuous 
when both the domain and the range D^ +1 are endowed with the Skorohod 
J\-topology. (c) The map $ is Lipschitz continuous in the sense that for any 
T > 0, there exists a constant C\ > such that 

sup |$(y)(t)-*(jQ(t)|<Cj. sup \y(t)-y(t)\ 

0<t<T 0<t<T 

for any y,y G D . 
(d) The map $ is positively homogeneous in the sense that 
(3.15) $(ay) = a$(y) for each a > and each y G D A ' +1 . 

Let Aq be the number of customers who are waiting in queue at time 
but will eventually abandon the system, and 

1 

A n — —— A n 
A - A . 



To state Theorem 1, we assume that 

(3.16) lg=>0 asn^oo. 

Clearly, condition (3.16) is satisfied if no customers are waiting in queue at 
time 0. The validity of this initial condition will be further discussed at the 
end of Section 3.1. 

Theorem 1. Consider a sequence of G/Ph/n + GI queues satisfying 
(2.1) and (2.2). Assume that p = l and that (3.5), (3.7) and (3.16) hold. 
Then 

(X n ,Z n )^(X,Z) as 00, 
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where 

(3.17) (X,Z) = *(U,V). 

Suppose that each customer, including those initial customers who are 
waiting in queue at time 0, samples his first service phase that he is yet 
to enter following distribution p at his arrival time to the system. One can 
stratify the customers in the waiting buffer according to their first service 
phases. For k = 1,...,K, we use Q k (t) to denote the number of waiting 
customers at time t whose initial service phase will be phase k [Q k (t) = 
for t > if phase k is not a first service phase for any customer] , and we use 
YP(t) to denote the number of phase k customers in the system at time t, 
either waiting or in service. Let Q n (t) and Y n (t) denote the corresponding 
fT-dimensional vectors. Set 

Ql(t) = ±=Ql{t), Y k n (t) = ^=Y k n (t), 

\Jn yjn 

(3.18) 

Y k n {t) = Y k n (t)-n lk for t>0. 

Clearly, 

(3.19) Y k n {t) = Ql{t) + Zl{t) and X n (t) = e'Y n {t) for t > 0. 

The following lemma says that for critically loaded systems, the waiting 
customers are distributed among the K phases following distribution p. It 
is known as the state-space-collapse (SSC) result. 

Lemma 2. Under the conditions of Theorem 1, for any T > 0, 

(3.20) -= sup \Q n (t) -p(X n (t)) + \ ^0 asn^oo. 

V n 0<t<T 

The following theorem is a corollary to Theorem 1 and Lemma 2. When 
there is no customer abandonment and the arrival process is renewal, it is 
identical to Theorem 2.3 of Puhalskii and Reiman (2000). 

Theorem 2. Under the conditions of Theorem 1, 

(X n ,Y n ,Z n )^(X,Y,Z) asrwoo, 
where (X,Z) is defined in (3.17) and 

(3.21) Y(t)=p(X(t)) + + Z(t). 
The process Y satisfies 

Y(t) = y (0) - Pfipt + $°(/xt) + P E(t) + M(t) 

-rI Y(s)ds + {R-aI)p [ (e'Y(s)) + ds fort>0. 
Jo Jo 
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The process Y in Theorem 2 is a diffusion process (see Rogers and Williams 
[(2000), page 110] or Karlin and Taylor [(1981), page 159] for a definition 
of diffusion processes). Therefore, Y is a continuous Markov process. The 
map in (3.17) defines (X,Z) as a (K + 1) -dimensional continuous process, 
which is degenerate because it lives on a if-dimensional manifold. From the 
-fC-dimensional process Y, one can recover the (K + l)-dimensional process 
(X,Z) via 

(3.22) X(t) = e'Y{t) and Z{t) = Y{t) - p{X{t)) + for t > 0. 

Therefore, (X,Z) is also a continuous Markov process. However, the pro- 
cess {X, Z) is not a diffusion process by the common definition because the 
function x + in (3.22) is not twice differentiable in x at 0. Whitt [(2005), 
Remark 2.2] makes a similar observation that his limit process is not a dif- 
fusion process, but a simple transformation of his limit process is a diffusion 
process. 

Our next theorem is concerned with the virtual waiting time process W n = 
{W n (t),t > 0}. Here, W n (t) is the potential waiting time of a hypothetical, 
infinitely patient customer who arrives at the queue at time t. When there 
is no customer abandonment and the arrival process is renewal, the theorem 
is implied by Corollary 2.3 and Remark 2.6 of Puhalskii and Reiman (2000). 



Theorem 3. Under the conditions of Theorem 1, 

(X) + 

(3.23) yW 1 => as n — y oo. 



We end this section by the following lemma, which gives a justification for 
imposing initial condition (3.16) in Theorems 1-3. Let Aq(t) be the number 
of customers in the nth system who are waiting in queue at time t, but will 
eventually abandon the system. Clearly, 

Its diffusion-scaled version is given by 

(3.24) A$(t) = ^A n Q (t) fort>0. 

Regarding the process Aq = {An(t),t > 0}, we have the following result. 



Lemma 3. Under the conditions of Theorem 1, 



(3.25) 



Aq as n — > oo. 
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The proof of Lemma 3 is presented in Appendix B. Assume that the queue 
is initially empty. Then condition (3.16) is satisfied at time t = 0. Under an 
additional assumption (3.7), Theorem 1 and Lemma 3 imply that for any 
t>0, 

A£(t)=s>0 and (X n (t),Z n (t)) => (X(t),Z{t)) asn^oo. 

Thus, if we start to observe the system at any fixed time t > 0, initial con- 
ditions (3.7) and (3.16) are indeed satisfied at time t. Condition (3.16) is 
used to prove asymptotic relationship (6.4) in the critically-loaded regime; 
this relationship between the abandonment-count process and the queue- 
length process is the key to extending the diffusion limits for G/Ph/n + M 
queues to the G/Ph/n + GI model with a general patience time distribu- 
tion. Condition (3.16) is necessary for the asymptotic relationship to hold. In 
Mandelbaum and Momcilovic (2009), an initial assumption similar to (3.16) 
is made for the G/GI/n + GI model in the critically- loaded regime. 

3.2. A diffusion limit for overloaded G/Ph/n + M queues. Our next re- 
sult is for overloaded G/Ph/n + M systems, where the patience times of all 
customers, including those waiting in queue at time 0, are assumed to be 
i.i.d. following an exponential distribution. We use a to denote the rate of 
the exponential patience time distribution. Note that this definition of a is 
consistent with the definition in (3.5). Assume that p > 1. Intuitively, when 
n is large, all n servers are 100% busy, and there should be nq customers on 
average waiting in the buffer, where 

(3.26) q=*-J±. 

a 

An intuitive explanation is as follows: A — p is the number of customers 
per unit of time that the system must "delete" in order for the system to 
reach an equilibrium. While in equilibrium, each waiting customer abandons 
the system at rate a, and collectively all q waiting customers abandon the 
system at rate qa customers per unit of time. Thus, one should have qa = 
A — p, which leads to (3.26). Readers are referred to Whitt (2004) for further 
discussion on the derivation of (3.26). 

Now we modify the definition of X n in (3.6) and U in (3.9). Let 

(3.27) X n {t) = -^X n (t), X n (t) = X n (t) - nq, 

Jn 



(3.28) U(t) = X(0) + E(t) - p(3t + e'M(t) - G(qt) 

for t > 0. In (3.28), the process G = {G(t),t > 0} is a one-dimensional drift- 
less Brownian motion starting from 0, which has variance a and is indepen- 
dent of E,$ ,..., $ K and S [recall that E, <6°, . . . , <& K and S are Brownian 
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motions denned in Section 3.1, and M is given by (3.11)]. When p = 1, one 
has q = 0. Thus, definitions in (3.27) and (3.28) are consistent with (3.6) 
and (3.9). Assume that 

(3.29) (X n (0),Z n (0))^(X(0),Z(0)) asra^oo 

for a pair of random variables (X(0), Z(0)). 

Before presenting Theorem 4, we introduce the next lemma, which is also 
a corollary of Lemma 9. 

Lemma 4. Let p be a K -dimensional vector that is the distribution of 
initial phases of the phase-type service times, R be the K x K matrix defined 
by (3.4) and a > be defined by (3.5). (a) For each (u,v) G B A ' +1 with 
u(t) G R and v(t) G R A for t>0, there exists a unique (x,z) G O k+1 with 
x(t) G R and z(t) G R K for t > 0, such that 



(3.30) x{t)=u{t)-al x{s)ds-e'R I 

Jo Jo 

(3.31) z(t) =v(t) - (I -pe')R I z{s)ds 



z(s) ds, 



Jo 

for t>0. (b) For each (n, v) G B K+1 , define $f(u, v) = (x, z) G B K+1 , where 
(x,z) satisfies (3.30) and (3.31). The map ^ is well defined and is con- 
tinuous when both the domain and the range are endowed with the 
Skorohod J\-topology. (c) The map $ is Lipschitz continuous in the sense 
that for any T > 0, there exists a constant C\ > such that 

sup \y(y)(t)-V(y)(t)\<CT sup \y(s)-y(s)\ 

0<t<T 0<t<T 

(3.32) 

for any y,y G B A+1 . 
(d) The map is positively homogeneous in the sense that 

(3.33) ^(ay) = ofl>(y) for each a > and each y G O k+1 . 

Theorem 4. Consider a sequence of G/Ph/n + M queues satisfying 
(2.1) and (2.2). Assume that p>l and that (3.29) holds. Then 

(X n , Z n ) ^ (X , Z) asn^oo, 

where 

(3.34) (X,Z) = V(U,V). 

Equation (3.34) defines (X, Z) as a (K + l)-dimensional diffusion process, 
which is also degenerate and lives on a ET-dimensional manifold. 
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3.3. A roadmap for proofs. Theorems 1 and 4 are the main results of 
this paper. Theorem 4 and a restricted version of Theorem 1 are proved 
in Section 5; the restriction is to assume that the patience times are ex- 
ponentially distributed. These proofs use standard FCLTs and then apply 
the continuous-mapping theorem and the random-time-change theorem. To 
construct appropriate continuous maps, we introduce a perturbed system 
in Section 4.1, which is equal in distribution to the original system when 
the patience time distribution is exponential. Using the perturbed system, 
we are able to construct a set of system equations in Section 4.2, which is 
critical to define the continuous maps. 

Section 6 is devoted to proving the general version of Theorem 1. When 
the patience time distribution is general and the systems are critically loaded, 
we first modify the preceding system equations slightly by replacing the cu- 
mulative number of customer abandonments by an integral of the queue- 
length process. We then apply an asymptotic relationship in Dai and He 
(2010) to establish a result that the error from the replacement is negligible 
under a stochastic boundedness assumption of the diffusion-scaled queue- 
length process; the latter assumption holds by a comparison result in Dai 
and He (2010) and the restricted version of Theorem 1 proved in Section 5. 

Theorem 2 is a corollary to Theorem 1 and Lemma 2; the latter is proved 
in Appendix B. Theorem 3 is also proved in Appendix B. 

4. System representation for a G/Ph/n + M queue. In this section, 
we first describe a perturbed system of the G/Ph/n + M model, and show 
that this perturbed system is equivalent to the G/Ph/n + M queue. We then 
develop the dynamical equations that the perturbed system must satisfy. 

4.1. A perturbed system. Now we describe a perturbation of the Gj ' Ph/n + 
M model. In the perturbed system, each phase has a service queue for the 
customers "in service." Only the leading customer in the service queue is 
actually in service; all others are waiting in the service queue, ordered accord- 
ing to the FIFO discipline. We use ZZ(t) to denote the number of customers 
in phase k service queue at time t. (Star-version quantities are associated 
with the perturbed system; the corresponding quantities in the original sys- 
tem are denoted by the same symbols without the star.) Each customer in 
the service queue is attached to exactly one server. Thus, there are exactly 
Zfc(t) servers in phase k at time t. All these Z%(t) servers simultaneously 
work on the leading customer in the service queue. The service effort received 
by the leading customer is additive, proportional to the number of servers 
working on the customer. Each customer has a service requirement for each 
phase that he visits, with phase k service requirement being exponentially 
distributed with mean l/v^. 
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When the total service effort spent on a customer reaches his service 
requirement in a phase, the service in the phase is completed. When a cus- 
tomer completes a phase k service, he immediately moves to the next phase 
following a sampling procedure to be specified below, taking with him the 
associated server. If the service queue in the new phase is not empty at their 
arrival, the server joins the service immediately, collaborating with other 
servers who are already in service to work on the leading customer in the 
service queue. The newly arrived customer joins the end of the service queue. 
If the new service queue is empty, the server works on her customer who is 
the only one in the new phase of service. 

When a customer finishes a phase k service, it uses the kth row of P to 
sample the next phase of service to join among phases {1, . . . ,K, K + 1}; 
the probability of selecting phase K + 1 is 1 — YleLi Phi- If ^ € {1, ... , K} is 
selected, both the server and the customer move next to phase £. If K + 1 is 
selected, the customer exits the system and the associated server is released. 
The released server checks the FIFO real queue to select the next customer 
to work on if the real queue is not empty. The selected customer is attached 
to the server until the customer exits the system. If the real queue is empty, 
the server becomes idle. 

At a customer's arrival time to the system, if there is an idle server, the 
customer grabs a free server and is attached to the server until the customer 
exits the system. Together, they move into the customer's first phase of 
service, which is selected according to distribution p. The service and waiting 
mechanism is identical to the one described in the previous paragraph. If all 
servers are busy at the customer's arrival time, the customer joins the end 
of the FIFO real queue. Only the leading customer in the FIFO real queue 
can abandon the system; other waiting customers are infinitely patient until 
they become a leading customer. The patience time of the leading customer 
is exponentially distributed with mean 1/a. The customer abandons the 
system without service if his patience clock exceeds the patience time. The 
patience clock starts from when the customer becomes a leading customer 
and increases at rate k when the queue length is k. 

For each n fixed, now we show that when the arrival process E n is a 
renewal process, the perturbed system and the original system are equivalent 
in a precise mathematical sense. For that, recall that the waiting buffer in 
the perturbed system maintains a FIFO queue for waiting customers. Let 

Q*(t) = (*!,...,*£.(*)), 

where L*(t) is the total number of customers waiting in queue at time t, 
and ig is the first service phase that the ith. customer is yet to enter. 

Let £(t) be the remaining interarrival time at time t. (£ has no star because 
the arrival processes in the perturbed system and in the original system are 
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identical.) It follows that, {(£(i), Q*(t),Z*(t)),t > 0} is a continuous-time 
Markov process living in state space R_|_ x /C°° x Z£ , where /C°° is the space 
of finite sequences taking values in /C = {1, . . . , K}. 

Let {(£(i), Q(t), Z(t)),t > 0} be the corresponding process of the original 
system. The process is also a continuous-time Markov process. At any time 
t, the phase k service rate is Z^{t)vk in the perturbed system and Zk{t)vk in 
the original system, while the abandonment rate is L*(t)a in the perturbed 
system and L(t)a in the original system. One can check that the two Markov 
processes 

{(£(*), Q(t),Z(t)),t>0} and {({;(t),Q*(t\Z*(t)),t>0} 

have the same generator. Thus, when they have the same initial distribution, 
they have the same distribution for the entire processes. In the following, we 
always choose the initial condition of the perturbed system to be identical 
to that of the original system. 

Even if the arrival process is not a renewal process, the perturbed system 
can still have the same distribution as the original system. The rest of the 
paper does not require the arrival process to be renewal. Rather, we assume 
that each arrival process satisfies the requirement that the perturbed system 
has the same distribution as the original one. See Tezcan (2006) for a more 
general treatment of perturbed systems. 

4.2. System equations. From now on, we focus on the perturbed system 
of the G/Ph/n + M queue and drop the stars attached to its quantities. 
We assume that the patience times of all customers, including those who 
are waiting in queue at time 0, are exponentially distributed with rate a. In 
this section, we describe the dynamical equations that the system must obey. 
For k = l,...,K, let <j> k = {<p k (j),j E N} be a sequence of i.i.d. "Bernoulli 
random vectors." For each j, the /^-dimensional random vector 4> k (j) takes 
vector e e with probability P^g and takes the /f-dimensional zero vector with 
probability 1 - Y%=\ P M- Similarly, let (p° = {<f>°(j),j E N} be a sequence of 
i.i.d. i^-dimensional random vectors; the probability that = e^ is pg. 

For k = 0, . . . , K, define the routing process 

TV 

$ fe (jV) = J2<t> k {j) foriVeN. 

3=1 

For each k = 1, . . . ,K, let be a Poisson process with rate v^, and let G 
be a Poisson process with rate a. We assume that 

(4.1) X n (0), E n , Si, ...,S K ,<f> ,...,$ K and G are mutually independent. 

Let (t) be the cumulative amount of service effort received by customers 
in phase k service in (0,i], B n (t) be the cumulative number of customers 
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who have entered service in (0,i] and D n {t) be the cumulative number of 
customers who have completed service in (0, t] . Clearly, 

(4.2) T£(t)= f Z%(s)ds fort>0. 

J o 

Then Sk{T^{t)) is the cumulative number of phase k service completions by 
time t. Also G( J Q * (X n (s)) + ds) is the cumulative number of customers who 
have abandoned the system by time t. One can check that for t > 0, the 
processes X n and Z n satisfy the following dynamical equations: 

K 

(4.3) Z n {t) = Z n {0) + $°(B n (t)) + $ k (Sk{T£(t))) - S{T n {t)), 

k=l 

(4.4) X n {t) = X n {0) + E n (t) - D n {t) -G^j\x n (s)) + ds\ , 

K 

D n (t) = 2> fc (Zy(*)) " e'<S> k (S k (TZ(t)))) 



(4.5) 



where 



k=l 

/ k 



-e'[^ k (S k (T^t)))-S(T n (t))\, 



\k=l 



S(T n (t)) = (S 1 (T 1 n it)),...,S K (TZ(t))) > - 
4.3. State-process representation. Define the centered processes 

N 

S(t) = S(t) - ut, G(t) = G(t) - at, ¥(N) = -p e ) 

i=i 

for t > 0, t = 0, . . . , K and iV6N, where p° =p and p k is the kth column of 
P' for k = l,...,K. Setting 

(4.6) M n (t) = $ h (Sk(mt))) -(I- P')S(T n (t)), 

k=l 

one then has 

Y j <S> k {S k {T%{t)))-S{T n {t)) = M n {t)-R / Z n (s)ds, 
k=i Jo 
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where R is defined in (3.4). By (4.3) and (4.5), 

(4.7) e'Z n (t) = e'Z n (0) + B n {t) - D n {t), 

(4.8) D n (t) = -e'M n (t) + e'R f Z n {s) ds. 

Jo 

It follows from (3.2) and (4.3)-(4.8) that 

Z n (t) = Z n (0) +p{X n {0)y + $°(B n (t)) -p{X n {t))~ 

+ (/ - pe')M n (t) -{I- pe')R [ Z n (s) ds, 

Jo 

X n (t) = X n (0) + E n (t) + X n t + e'M n (t) - e'R f Z n {s) ds 

Jo 

-d( [\x n {s)) + ds) - a f {X n (s)) + ds. 



Recall that Z n (t) = Z n (t) - 717. We then have 

Z n (t) = {I- pe')Z n {0) + $°(B n (t)) - p(X n (t))- 

+ (/ - pe')M n (t) -(I- pe')R I Z n {s) ds, 

Jo 

X n (t) = X n (0) + E n (t) + (A™ - nn)t + e'M n (t) - e'R I Z n (s) ds 

Jo 

— Cffj (X n (s)) + ds \ - a J\x n {s)) + ds, 

where we have used (3.3) and (3.4) in the derivations. Setting 
U n {t) = X n (0) + E n (t) + (A n - n/i)t 

(4.9) 

+ jM n {t)-6(J (X n (s)) + ds\ 

(4.10) V n {t) = {I- pe')Z n {0) + 3>°(B n (t)) + pe')M n (t) 
for t > 0, we finally have 

(4.11) X n (t) = U n (t) - a [\x n (s)) + ds - e'R f Z n {s) ds, 

Jo Jo 

(4.12) Z n (t) = V n (t) - p{X n (t))- -(I- pe')R f Z n (s) ds. 

Jo 
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By Lemma 1, we have obtained the following representation for the state 
processes: 

(4.13) (X n ,Z n ) = $(U n ,V n ). 

5. Proofs for G/Ph/n + M queues. This section provides proofs for 
Theorem 4 and a special version of Theorem 1 when the patience time 
distribution is exponential. Section 5.1 first establishes a fluid limit, which is 
needed in applying the random-time-change theorem to prove the theorems 
in Section 5.2. 

5.1. Fluid limits. For t > 0, define the fluid-scaled processes B n , D n , E n , 
f n , X n and Z n via 

B n (t) = -B n (t), D n (t) = -D n (t), E n (t) = -E n (t) : 
n n n 

f n (t) = -T n {t), X n (t) = -X n (t), Z n (t) = -Z n (t). 
n n n 

Theorem 5. Consider a sequence of G/Ph/n + M queues satisfying 
(2.1) and (2.2). Assume (3.29) holds. Then 

(5.1) {B n , D n , E n , T n , X n , Z n ) => (B, D, E, f, X, Z) asn^oo, 

where B(t) = fit, D{t) = fit, E(t) = Xt, f{t) = jt, X(t) = q and Z{t) = 7 for 
t>0. 

PROOF. For t > 0, let 

M n (t) = -M n (t), U n {t) = -U n (t), 

(5.2) 

V n (t) = -V n (t), L n {t) = -Z n (t). 
n n 

By (4.13) and the positively homogeneous property of <£, we have 

(X n ,L n ) = $(U n ,V n ). 

Setting 

(5.3) U(t) = q + (X-n)t and V(t) = for t > 0, 
one can check that <&(??, V") = (X, 0). We are going to show that 

(5.4) {M n ,U n ,V n )^(0,U,0) asra^oo. 

Assuming (5.4), we now complete the proof of the theorem. The continuity 
of the map $ implies that 

(X n ,L n ) = <S>(U n ,V n ) => $(U,V) = (X,0) asn^oo. 
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Since Z n (t) = L n (t) + 7 for t > 0, then Z n => ^ as n — > 00, from which one 
has T n =>• f as n -)■ 00. By (4.8), 



D n (t) = -e'M n (t) + e'R [ Z n (s)ds. 

Jo 



Since e'i? J Q Z(s)ds = [it for i > 0, by the continuous-mapping theorem 
Z) n l) as n — 7- 00. The convergence of -D n and (4.7) imply that B n B 
as n — )• 00, and I? satisfies 

e'Z(t) = e'Z(O) + B{t) - D{t) for t > 0. 

Since Z(t) = Z(0) = 7, we conclude that B{t) = fit for t > 0. By assumptions 
(2.1) and (2.2), for each T > 0, 

(5.5) - sup |£ ,n (t)|=^0 asn-^oo, 
re 0<t<T 

which implies that E n E. This proves the theorem when (5.4) holds. 

It remains to prove (5.4). By the functional strong law of large numbers 
(FSLLN), 

- sup |G(ni)|=>0, - sup |S(ni)|=>0, 
n o<t<T n 0<t<T 

(5.6) 

- sup \$ k (lnt\)\ ^0 

™ 0<<<T 

asn^oo, for k = 0, . . . , if. Let S n (i) = 5 n (nt) /n for t > 0. The FSLLN also 
leads to S n S as n — > 00 where S'(t) = z^t for t > 0. This, together with 

(4.6) , (5.6) and the fact T"(i) < t for t > 0, implies that 

(5.7) sup |M n (i)| =>0 asn-^oo. 

0<t<T 

Note that B n (t) < (X™(0))+ + E n (t). By (3.7) and the convergence of E n , 
the sequence of processes {B n ,n G N} is stochastically bounded, that is, for 
each T > 0, 



lim limsupl 



sup 5 n (t) > a 
0<t<T 



0. 



Using this and (5.6), we deduce that 

(5.8) sup -$°(5 n (i))^0 as n — > 00. 

o<t<T " 

Condition (3.7) implies that Z n (0)/n =4> as n — > 00, which, together with 
(5.7) and (5.8), leads to V n => V as n -)• 00. Since sup < t < r (X ?1 (t)) + < 
(X n (0)) + + E n (T), one can argue similarly that U n ^> U as n — > 00. Hence, 
we have shown (5.4) holds and thus proved the theorem. □ 
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5.2. Diffusion limits. In this section, we prove Theorems 1 and 4, under 
the assumption that the patience times are exponentially distributed. 
Define the diffusion-scaled processes G n , S n and <£> , . . . , & K via 

G n {t) = ^=G{nt), S n {t) = ^=S{nt), $ k > n (t) = -L$ k ([nt\) 
\ n Jn Jn 



for t > and k = 0, . . . , K. By the FCLT, one has 

(G n , S n , 4>°' n , . . . , $ K ' n ) =>- (G, S, $°, . . . , <I A ') as n oo, 

where G is a one-dimensional driftless Brownian motion, and S" and . . . , & K 
are i^-dimensional driftless Brownian motions. As mentioned previously, 
the variance of G is a, the covariance matrix for S is diag(z^), and for 
k = 0, . . . ,K, the covariance matrix for is H k given by (3.8). By the FCLT 
assumption (2.2) for the arrival process E n , the initial condition (3.29), and 
the independence assumption (4.1), one has 

(X n {0),Z n {0),E n ,G n , S n , i>°' n , . . . , $ K ' n ) 

(5.9) 

^(X(0),Z(0),E,G,S,$°,...,<S> K ) 

as n — > oo. The components of (E, G, S, $°, . . . , & K ) are mutually indepen- 
dent, and they are independent of (X(0), Z(0)). 

Let U n (t) = U n {t) — nU(t), and define the diffusion-scaled processes 

U n {t) = ^=U n {t) and V n {t) = -^=V n (t) for t > 0, 

Jn Jn 



where U is defined in (5.3). We now have the following lemma. 

Lemma 5. Consider a sequence ofG/Ph/n + M queues satisfying (2.1) 
and (2.2). Assume that (3.29) holds. Then 

(U n ,V n )^(U,V) asn^oo, 

where (U,V) is a (K + 1)- dimensional Brownian motion defined by (3. 
and (3.10). 

Proof. By (4.9) and (4.10), 

U n (t) = X n {0) + E n {t) + Jli(-X n - A J 

(5.10) 

+ e'M n {t) - G n (J (X n (s)) + ds^j , 
(5.11) V n (t) = (I- pe')Z n (0) + $°' n (B n (t)) + pe')M n {t), 
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where 

1 * 

M n {t) = —M n (t) = $ k,n {S%(f£(t))) -(I- P')S n {f n {t)) 

and S n (t) = S{nt)/n for t > 0. By the FSLLN, S n S as oo, where 
S(t) = vt for t > 0. The lemma now follows from (5.9), Theorem 5, the 
continuous-mapping theorem and the random-time-change theorem. □ 

Proof OF Theorem 1 (Assuming an exponential patience time distribu- 
tion). Since p = l, it follows that q = and A = /i. Then U(t) = for t > 0. 
It follows from the state-process representation (4.13) and the positively 
homogeneous property of the map $ that 

(X n ,Z n ) = <Z>{U n ,V n ). 

The theorem now follows from Lemma 5 and the continuous-mapping theo- 
rem. □ 

Although condition (3.16) is not explicitly required in the above proof, it 
can be deduced by using initial condition (3.7) and the assumption that the 
patience times of all customers, including those in queue initially, are i.i.d. 
following an exponential distribution. 

Before proving Theorem 4, we first establish a lemma. It says that when 
p > 1 , the number of idle servers goes to zero under diffusion scaling. 

Lemma 6. Let I n (t) = (X n (t))~ and I n {t) = I n (t)/y/n for t > 0. Then 
under the conditions of Theorem 4, 

J n =^0 as n— too. 
PROOF. It follows from (4.11) and (4.12) that 

-^=X n (t) = V^U(t) + U n (t)-^= l\x n {s)) + ds-e'R Z n (s)ds, 
V n V n Jo Jo 

Z n (t) = V n (t) - -V-(X n (t))- -{I- pe')R [ Z n {s) ds. 



m Jo 

Therefore, by Lemma 1 



-Lx n ,zA = $(U n + v^t?, V r 
Vn J 



By the Lipschitz continuity property (3.14) of the map $, for any T > 0, 
there exists a constant C\ > such that 

sup MU n + Vn~U,V n )(t)-<S>(^/nU,0)(t)\ <C^ sup {\U n (t)\ + \V n (t)\} 

0<t<T 0<t<T 
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for all n and all sample paths. One can check that <&(y/nU,Q) = (^ynq,0). 
Therefore, 

(5.12) inf -Lx»(t)>VZq-C$. sup {|*7 n (i)| + |^(i)|}. 

0<t<T -y/n 0<t<T 



By Lemma 5, 



lim limsupP 



sup {\U n (t)\ + \V n (t)\}>a 

0<t<T 



0, 



which, together with (5.12), implies that swp 0<t<T I n (t) => as n— > oo. The 
lemma is proved. □ 

Proof of Theorem 4. It follows from (4.11) and (4.12) that 

X n (t) = U n (t)--^= [\x n (s))~ds-a I X n {s)ds-e'R I Z n (s)ds, 
V« Jo Jo Jo 

Z n {t) = V n (t) - -E=(X n {t))- -{I- pe')R t Z n {s) ds. 
V n Jo 

Let 

8 n {t) = ^= {X n {s))-ds and e n {t) = ^=(X n (t))' for t > 0. 
V n Jo V n 

By Lemma 4, 

(X n , Z n ) = ^{U n - 5 n , V n - e n ). 

By Lemma 6, 

(5.13) {5 n ,e n ) => (0,0) asn^oo. 

The theorem follows from Lemma 5, (5.13) and the continuity of the map 
*. □ 

6. Proofs for critically loaded G / Ph/n + GI queues. In this section, 
we prove Theorem 1 for a general patience time distribution. Consider a 
sequence of G/Ph/n + GI queues indexed by n. Our starting point is the 
perturbed system described in Section 4.1 with the following modification: 
each customer in queue can abandon the system, not just the leading cus- 
tomer; when a customer's waiting time in the real FIFO queue exceeds his 
patience time, the customer abandons the system. By the same argument 
as in Section 4.1, for each n, the modified perturbed system is equivalent 
in distribution to the original G/ Ph/n + GI queue. In particular, the sys- 
tem equations (4.3)-(4.5) derived in Section 4.2 hold, except that (4.4) is 
modified as follows: 

(6.1) X n {t) = X n {0) + E n {t) - D n {t) - A n (t), 
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where A n {t) denotes the cumulative number of customers that have aban- 
doned the system by time t. We call A n = {A n (t),t > 0} the abandonment- 
count process in the nth system. 

With systems equations (4.3), (6.1) and (4.5), one can derive representa- 
tion (4.13) 

(X n ,Z n ) = <f>(U n ,V n ) 

with U n modified as 

U n (t) = X n (0) + E n (t) + (A n - nn)t + e'M n (t) 

(6.2) 

-A n {t) + a [ (X n (s)) + ds. 
Jo 

The derivation is identical to the one in Section 4.3 and is not repeated here. 

Before we prove Theorem 1, we state two lemmas, which will be proved 
at the end of this section. The first lemma follows from a main result in 
Dai and He (2010). In the lemma, the diffusion-scaled abandonment-count 
process A n = {A n (t),t> 0} is defined by 

(6.3) A n {t) = -^=A n {t) fort>0. 



Lemma 7. Under the conditions of Theorem 1, for any T > 0, 



(6.4) 



sup 

0<t<T 



A n {t)-a / (X n (s)) + ds 



as n — > oo. 



The next lemma is a generalization of the fluid limit theorem (Theorem 
5) to general patience time distributions, but with the restriction that p = l. 



Lemma 8. Under the conditions of Theorem 1, 

(6.5) (B r \D n ,E n ,f n ,X n ,Z n )^(B,D,E,f,X,Z) asn^oo, 

where B n , D n , E n , T n , X n and Z n are fluid-scaled processes defined at the 
beginning of Section 5.1, and B{t) = fit, D(t) = fit, E(t) = Xt, T(t) = ^t, 
X(t) = and Z{t) = 7 for t > 0. 



Proof of Theorem 1. Using the representation (4.13) with U n given 
by (6.2), one has 



(X n ,Z n ) = ${U n ,V n ), 
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where 

U n (t) = X n (0) + E n (t) + ^(-X n -\\+ e'M n (t) 

(6.6) 

A n {t)-a / {X n {s)) + ds 



o 

and V n is given by (5.11). By Lemma 1, the map <]? is continuous. Thus, to 
prove the theorem, it suffices to prove that 

(6.7) (U n ,V n )^(U,V), 

where (U, V) is the (K + l)-dimensional Brownian motion defined by (3.9) 
and (3.10). The convergence (6.7) follows from the proof of Lemma 5 with 
the following two modifications. First, the last term of U n in (6.6) is 

A n {t)-a j\x n {s)) + ds 

instead of 

G n (j\x n {s)) + ds 

in (5.10). We apply Lemma 7 to conclude that the last term in (6.6) converges 
to zero in distribution. Second, we use (6.5) instead of (5.1) in order to apply 
the random-time-change theorem to finish the proof of (6.7). □ 

Remark. It follows immediately from Theorem 1 and Lemma 7 that 
under the conditions of Theorem 1, 

(6.8) A 11 => A asn-^oo, 
where 



A(t) = a [ (X(s)) + ds fort>0. 
J o 



Proof of Lemma 7. We use Theorem 2.1 of Dai and He (2010) to prove 
the lemma. In order to apply the theorem, we need only verify that the se- 
quence of diffusion-scaled queue-length processes is stochastically bounded, 
that is, for any T > 0, 



(6.9) lim limsupP 



n— >oo 



sup -^(X n (t)) + >a 
0<t<T V n 



0. 



Theorem 2.2 of Dai and He (2010) states a comparison result: the queue 
length at any time in a G/G/n + G queue is dominated by the queue length 
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in the corresponding G/G/n queue without abandonment. Thus, (6.9) is 
implied by the stochastic boundedness of the sequence of diffusion-scaled 
queue-length processes in the corresponding G/Ph/n queues. Examining 
the proof of Theorem 1 in Section 5.2 for an exponential patience time 
distribution with rate a > 0, one concludes that Theorem 1 holds for the 
corresponding G/Ph/n queues without abandonment by setting a = 0. As a 
consequence, the sequence of diffusion-scaled queue-length processes in the 
G/Ph/n queues is stochastically bounded. □ 

Proof of Lemma 8. The proof of the lemma follows the proof of The- 
orem 5 with the following two modifications. First, U in (5.3) becomes zero 
in the current case because p=l. Second, U n has the representation (6.6) 
instead of (4.9). In Theorem 5, to prove U n =>- as n — > oo we used the fact 

— sup |G(nt)|=^0 as n — > oo, 

n 0<t<T 

which is proved in (5.6). Here, we need 

1 r* 

- sup A n (t)-a I {X n (s)) + ds =>• as n -> oo, 

n 0<t<T JO 

which holds because of Lemma 7. □ 

APPENDIX A: A CONTINUOUS MAP 

Let K G N be a fixed positive integer. Given functions h\ :]R ft:+1 — > M, 
h 2 : R K+1 -> R K and g : R -> R K , we wish to define a map T : B K+1 -> B K+1 . 
For each y = (yi,y 2 ) G ^ K+1 with yi (t) G R and y 2 (*) G R^ for t > 0, T(y) 
is defined to be any x = (xi,x 2 ) G B K+1 with xi(t) G R and x 2 (i) G R K for 
i > that satisfies 

(A.l) x 1 (t)=y x {t)+ ! h^xis^ds, 

Jo 

(A.2) x 2 (t) = y 2 (t)+ [ h 2 (x(s))ds + g( Xl (t)) 

Jo 

for t > 0. We assume that /ii, /i 2 and g are Lipschitz continuous. For a 
function / : R d — > R m with d, m G N, it is said to be Lipschitz continuous 
with Lipschitz constant c> if 

\f(u)-f(v)\<c\u-v\ for n, -y G R d 

(recall that |ti| = maxi<fc<^|tifc| denotes the maximum norm of u). The func- 
tion / is said to be positively homogeneous if 



f(au) = af(u) for any a > and u G 
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Given d G N, x G D d and T > 0, set \\x\\t = sup 0<t<T |x(t)| . 

The following lemma establishes the existence and the continuity of the 
map T. 

Lemma 9. Assume that hi, h 2 and g are Lipschitz continuous, (a) For 
each y = (yi,y 2 ) G ^> K+1 with yi(t) G R and y 2 (*) G R^ /or t > 0, i/iere ex- 
ists a unique x = (xi,X2) G J} K+1 with x\(t) G R and ^(t) G R"^ for t > 
i/iai satisfies (A.l) and (A. 2). (b) TTie map T:D^ +1 ->D jftr+1 zs Lipschitz 
continuous in the sense that for each T > 0, there exists a constant Ct > 
such that 

\\T(y) - T(y)\\ T < C T \\y - y\\ T for any y,ye B K+1 . 

(c) The map T is continuous when the domain O k+1 and the range ~B K+1 
are both endowed with the Skorohod J\-topology. (d) If, in addition, hi, h 2 
and g are assumed to be positively homogeneous, then the map T is positively 
homogeneous in the sense that 

T(ay) = aT(y) for each a > and each y G H) K+l . 

Proof. Assume that hi, h<i and g are Lipschitz continuous with Lip- 
schitz constant c> 0. Let y = (1/1,2/2) £ H) K+l be given. Let T > be fixed 
for the moment. Define x° = y and for each n G Z+, let x n+1 = (x^ +1 ,x 2 +1 ) 
be defined via 

rt 

n+l, 



*i (t)=m(t)+ / hi(x n (s))ds, 
Jo 

x n 2 + \t) = y 2 {t)+ [ t h 2 (x n (s))ds + g(x^ 1 (t)) 



for t G [0,T]. Setting 



X( n )(t) = ||x n+1 -x n || 4 , 



because 

xl -h 2 (x n -\s)))d, 

hi(x n (s))ds 



+ 9(vi(t) + J 



ds 



for i G [0,T], one has 

X {n+1 \t) < (c + c 2 ) f X^ n \s)ds for t€ M . 
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Then, by Lemma 11.3 in Mandelbaum, Massey and Reiman (1998), 

rpn 

X n+1 (t)<(c + c 2 )— sup X^(s) forte [0,21. 
n\ o< s <t 

Therefore, similarly to (11.22) in Mandelbaum, Massey and Reiman (1998), 
{x n ,n G N} is a Cauchy sequence under the uniform norm || • ||^. Since 
(B([0,T],lR i< ' +1 ), || • \\t) is a complete metric space (being a closed subset 
of the Banach space of bounded functions defined from [0,T] into 
and endowed with the uniform norm), {x n ,n G N} has a limit x that is 
in D([0,r],M A ' +1 ). One can check that x satisfies (A.l) and (A.2) for t G 
[0,T]. This proves the existence of the map T from D([0, T], M. K+1 ) to 
O([0,T],R K+1 ). 

Now we prove that the map from D([0, T],M A ' +1 ) to BQC^T],]^ 4 - 1 ) is 
Lipschitz continuous with respect to the uniform norm. Assume that y,y e 
O([0, T], R K+l ). Let T(y) be any solution x such that x and y satisfy (A.l) 
and (A.2) on [0,T]. Similarly, let T(y) be any solution associated with y. 
Setting x = (xi, £2) = T(y) and x, = {x\,x-i) = T(y), then for any t G [0, T], 

Mt) - xi (t) | < \y(t) - y(t)\ + c f |T(y)(s) - T(y)( S )\ ds, 

Jo 

\x 2 (t) - x 2 (t)\ < {1 + c)\y{t) - y(t)\ 

+ (c + c 2 ) /V(?/)(s)-T(y)( S )|(fe. 
Jo 

Hence, 

|T(y)(t)-T(y)(t)| 

< (1 + c)\y(t) - y(t)\ + (c + c 2 ) / |T(y)( S ) - T(y)(s)| rf S 

Jo 

forte [0,T]. 

By Corollary 11.2 in Mandelbaum, Massey and Reiman (1998) 

||T(y)-T(y)|| r <(l + C )||y-y|| T exp(( C + C 2 )T). 

Hence, T is Lipschitz continuous, which implies part (b) of the lemma. The 
Lipschitz continuity of T as a map from D([0, T], R K+1 ) to B([0,T\,R K+x ) 
shows that it is well defined on [0,T]. Since T > is arbitrary, T as a map 
from D A ' +1 to B K+1 is well defined. This proves part (a) of the lemma. 

Next we prove the continuity of T provided that I$ K+1 is endowed with 
the Skorohod J\ -topology [see, e.g., Section 3 of Whitt (2002)]. Consider 
a sequence {y n ,n G N} and y in B A+1 such that y n — >■ y as n — > oo. Let 
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x n = (a^Xg) = '^{y n ) an d x = (^1 ? ^2) = T(y). Note that since x € H) K+1 
there exists M > such that 



(A.3) 



IT 



T 



<M. 



Let A be the set of strictly increasing functions A : M + — > R + with A(0) = 0, 
lim^oo X(t) = 00, and 



7(A) = sup 

0<s<t 



log 



A(t) - A(s) 



i-s 



< 00. 



Since y n — )• ?/ as n — )• 00 in the J\ -topology on O k+1 , it follows from Propo- 
sition 3.5.3 of Ethier and Kurtz (1986) that there exists a sequence {X n ,n £ 
N}cA such that 



(A.4) 

and for each T > 
(A.5) 



lim 7 (A n ) = 0, 



lim \\y n (-)-y(X n (- 



0. 



For each A n E A, X n (t) is Lipschitz continuous in t. Hence, it is differentiable 
almost everywhere in t with respect to the Lebesgue measure. Furthermore, 
it follows from (3.5.5) of Ethier and Kurtz (1986) that when A n is differential 
at time t, its derivative X n (t) satisfies 

(A.6) \X n {t) -1| < 7 (A n ). 

Note that, for i = 1,2 

(A.7) / h l {x(s))ds = / hi(x{X n (s)))X n (s)ds. 

Jo Jo 

By (A.l) and (A.7) 



Xl (X n (t))= yi (X n (t)) + 



hi(x(s)) ds 



(Ai 



yi(X n (t))+ / h 1 (x(X n (s)))X n (s)ds 



yi(X n it))+ / h 1 (x(X n (s)))d s 



h 1 {x(X n (s)))(l-X n (s))ds. 



MANY-SERVER DIFFUSION LIMITS 31 
Similarly, by (A.2) and (A. 7) 

x 2 (X n (t))=y 2 (X n (t)) + fh^xi^is^ds 

Jo 

(A.9) - [' h 2 (x(X n (s)))(l-X n (s))ds 

Jo 

+ g(x 1 (X n (t))). 

By (A.l) and (A.8) 

|x?(f)-si(A»(t))| 

< \y?(t) - yi(X n (t))\ + f Mx n (s)) - M*(A n (*)))| ds 

Jo 

t \h 1 (x(X n (s)))-h 1 (0)\\l-X n (s)\ds 



(A.10) 



o 

+ I 1^(0)111 -X n (s)\ds 
Jo 



< \y n (t) - y(X n (t))\ + c f \x n {s) - x(X n (s))\ ds 

Jo 

+ c \x(X n {s))\\l - X n (s)\ds + 1^(0)1 /'|l-A n (s)|ds 



By (A.2), (A.9) and (A.10) 
\x n 2 {t)-x 2 (X n {t))\ 

<\y n 2 (t)-y 2 (X n {t))\ 



+ f\h 2 (x n (s))-h 2 (x{X n {s)))\ds 
Jo 

+ \g(xUt))-g(x 1 (X n (t)))\ 

\h 2 (x(X n (s)))-h 2 (0)\\l-X n (s)\ds 



\h 2 (0)\\l-X n (s)\ds 
< \y n (t) - y(X n (t))\ + c f \x n {s) - x(X n (s))\ ds 



+ c\x^(t)-x 1 (X n (t))\+c / \x(X n (s))\\l-X n (s)\ds 
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(A.ll) +M0)| / 4 |1-A n (s)|ds 

ft 

„2\ 







<(l + c)\y n (t)-y(X n (t))\ + (c + c z ) / \x n (s) - x(X n (s))\ds 

Jo 

+ (c + c 2 ) /V(A n (s))||l-A n (s)|(i S 



+ (|/i 2 (0)|+c|/» 1 (0)|) / |l-A n (s)|<fe 

JO 

Then (A.10) and (A.ll) yield 

|T(„»)(t)-T(y)(A"(t))| 

<(l + c)|^(t)-y(A"(t))| 
/■t 



(A.12) +{c + c') \T(y n )(s)ds-T(y)(X n (s))\ds 

Jo 

+ (c + c 2 ) r|l-A"( S )||T(y)(A-( S ))|d S 



+ (|M0)|+c|M0)|) / |l-A n (s)|d S . 







It follows from (A. 3), (A. 4), (A. 6) and the dominated convergence theorem 
that 

(A.13) / |l-A n (s)||T(y)(A n (s))|ds^O asn^oo. 



o 



Given 5 > 0, by (A. 4), (A. 6) and (A.13), for n large enough 
(c + c 2 ) [ T \l-X n (s)\\T(y)(\ n is))\ds 



and by (A.5) 



/ T |l-A"( S )|d S <J 



(l + c)||y»O-y(A»(0)|| T <£. 
By Corollary 11.2 in Mandelbaum, Massey and Reiman (1998) and (A.12) 

||T(y»)(.) - T(y)(A»(-))||T < 5exp((c + c 2 )T) 
for large enough n. Thus, for each T > 0, 

Iim||T( ! ,»)(.)-T(y)(A B (0)||T = O. 

n— >oo 
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Hence, T(y n ) — > T(y) as n — > oo in D +1 in the J\ -topology. This implies 
part (c) of the lemma. 

To prove part (d) of the lemma, for y £ O k+1 , assume that x and y 
satisfy (A.l) and (A. 2). Then, for a > 0, one can check that ax and ay also 
satisfy (A.l) and (A. 2) because of the positive homogeneity of hi, hi and g. 
Therefore, T (ay) = aT(y). □ 

APPENDIX B: PROOFS OF LEMMAS 2, 3 AND THEOREM 3 

This section is devoted to proving Lemmas 2, 3 and Theorem 3. We first 
present two lemmas. 

The first lemma is an immediate result by Proposition 4.4 of Dai and He 
(2010). It proves that the virtual waiting time processes (see the paragraph 
prior to Theorem 3 for the definition) converge to zero in distribution as 
n — > oo. 

Lemma 10. Under the conditions of Theorem 1, 

W n => as n -too. 

For t > 0, let 

(B.l) C(t) = mf{s>0:s + W n (s) >t}. 

Since s + W n (s) < t for all s < C n (0> each customer arriving before time 
C n (t) cannot be waiting in queue at time t [see Lemmas 3.2 and 3.3 of Dai 
and He (2010) for a detailed explanation]; similarly, since s + W n (s) > t for 
all s > C n (^)> a customer who arrives after time C n (^) cannot be in service 
at t. So C n (t) is a crucial epoch with respect to the queue length at time t. 
The next lemma concerns the process Q n = {Q n (t),t > 0}. 

Lemma 11. Under the conditions of Theorem 1, (J 1 G D is nondecreasing 
for each n E N, and 

C n =^C asn^-oo, 
where £(i) =t for t > is the identity function on R + . 

Proof. First note that 

(B.2) C{t) + W n (C{t))>t fort>0, 

because W n is right-continuous. 

Next, we prove that £ n is nondecreasing in t. Suppose, on the contrary, 
that for some < s < t, we have C n (^) < C n ( s )- This implies by (B.l) that 
for any ( n (t) <u< ( n (s), 

t<u + W n (u) < s, 
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leading to a contradiction. 

Now we prove that (J 1 6 D, that is, £ n is right-continuous on [0, oo) and 
has left limits on (0,oo). Since C n (^) 5; t by (B.l) and Q n is nondecreasing, 
C n (t—) exists for each t > 0; therefore, (J 1 has left limits on (0,oo). To prove 
right-continuity, fix e > and t > 0. We have 

C n (i) + e + W n (C(i) + e) > t + 5 for some 6 > 0, 

so that C n (t + 5') < C n (* + 5) < C n (*) + e for < 5' < 5. Hence, C™ is right- 
continuous at £, proving ("GD. 

Finally, we prove the convergence. By (B.2) and the fact C"(i) < f° r an Y 
T>0, 

sup |t-C(i)|< sup W n {( n (t))< sup W n (t). 

0<i<T 0<t<T 0<t<T 

Then £ n => £ as n — >• oo follows from Lemma 10. □ 

Proof of Lemma 2. Fix T > 0, and restrict f G [0,T]. Since each cus- 
tomer arriving before time C n (0 W1U either have entered service or aban- 
doned the system by time t, we have {X n {t)) + < E n {t) - E n {( n (t)) + A n (( n (t)) 
where A n (t) = E n (t) — E n {t—) is the number of customers who arrive (ex- 
actly) at time t. Because C n (*) < t by (B.l), we have 

sup A n (( n {t)) < \\A n \\ T 

0<t<T 

for ||A n ||*r = sup 0<t<T A n (i); thus, 

(B.3) {X n {t)) + < E n {t) - E n (C(t)) + ||A n '|| T . 

Similarly, because a customer who arrives during (( n (t),t] will either be 
waiting in queue at time t or has abandoned the system by t, one has 

(B.4) {X n (t)) + > E n (t) - E n {C(t)) - (A n (t) - A n (C(t))). 

Let £ N} be a sequence of i.i.d. ^-dimensional random vectors 

such that for k = 1, . . . ,K, the probability that = e k is pk', it is used to 
indicate the initial service phase of each customer (see the first paragraph 
in Section 4.2). Write 

JV 

^°(N) = ^V°00 and ¥°(JV) = *°(JV) - pN. 

i=l 

Because the customers who arrive before time £ n (i) cannot be waiting in 
queue at time t (they have either abandoned the system or started service) , 
for k = 1, . . . , K , 

Q n k (t) < *°((X"(0))+ + E n (t)) - *°((X"(0))+ + E n (C{t)) - || A"|| T ) 
(B.5) = ££((X n (0)) + + E n {t)) - *£((X n (0))+ + E n (C{t)) - \\A n \\ T ) 
+ p k (E n (t)-E n (C(t)) + \\A n \\ T ). 
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Similarly, the customers who arrive during (£ n (t),i] cannot get into service 
by time t. Then 

Q n {t) + {A n {t) _ A n {Cm 

> ¥°((X»(0))+ + E n (t)) - *g((JT»(0))+ + E n (C(t))) 

(B-6) 

= ttg((X*(0)) + + E n (t)) - *g((J^(0))+ + ^(C n (*))) 

+ Pfc (^(t)-£^(C ,l (<))). 
Combining (B.3)-(B.6), we have 

(B.7) Aftt) <<ft(t) -p*(jr(t))+<D5(t), 

where 

Ajj(t) = *° fc ((x n (o))+ + - ^(pr"(o))+ + ^(C"(t))) 

_(A"(t)-A»(C"(t)))-p fc ||A»|| r , 

Dj(t) = *° fc (pT n (0))+ + E n (t)) - *° fc ((X"(0))+ + ^"(C n (t)) - || A»|| T ) 

+ p fc (||A n || T + A"(t)-A n (C n (t))). 

Let §°' n (t) = *°(LntJ)/V^, ||A™|| r = ||A n i| T /V^, and || A n || T = || A n || T /n. 
Rewriting (B.7) using diffusion scaling one has 

(b.8) im < -±=(Qm-Pk(x n (t)) + ) < nm, 

V n 

where 

A£(t) = tf°' n ((X"(0)) + + E»(t)) - §°' n ((A >n (0)) + + £"(<*(*))) 

_(I".( t )_A«(C"(t)))-^||A"|| r , 
n™(t) = £J"((X"(0))+ + - §°' n ((X™(0))+ + £ n (c n W) - ||Alr) 

+ p k (\\A n \\ T + A n (t)-A n (C(t)))- 

Next, we show that A£ =>■ and =^0 as n — > oo, which, together with 
(B.8), will lead to (3.20). Using (2.2), we have 

(B.9) ||A n || T ^0 asrwoo. 

Lemma 11, (6.8), Theorem 3.9 in Billingsley (1999) and the random-time- 
change theorem [see the lemma on page 151 of Billingsley (1999)] yield 

(B.10) sup \A n (t) -A n (C(t))\ ^0 asn^oo. 

0<t<T 

By Theorem 5, Lemma 11 and the random-time-change theorem, 
(B.ll) (X n (0))+^0 and E n (( n (-))^E asn^oo. 



3G 



J. G. DAI, S. HE AND T. TEZCAN 



Since \& 0,n => ^° where is a .fT-dimensional Brownian motion, by Theo- 
rem 5, (B.9), (B.ll) and the random-time-change theorem 



(B.12) 



(B.13) 



sup |^°' n ((X n (0))+ +E n (t)) 

0<t<T 



§°' n ((X n (0))+ + E n (( n (t)) - \\A n \\ T )\ 0, 
sup |^' n ((X n (0))+ +E n (t)) 

0<t<T 

-*° k ' n ((X n (0))+ + E n (e(t)))\^0 



as n ^ oo. We deduce from (B.9)-(B.13) that AJJ =>■ and n£ 

□ 



as n — > oo. 



Proof of Theorem 3. Since all customers arriving prior to time t > 
will have either got into service or abandoned the system by time t + W n (t) 
[see Lemmas 3.2 and 3.3 of Dai and He (2010)], then 

(X n (t + W n {t))) + < E n {t + W n {t)) - E n (t). 

For a customer who arrives during (t, t + W n (i)], he can possibly be waiting 
in queue at time t + W n (t), or have abandoned the system by t + W n (t), or 
starts his service (exactly) at t + W n (t). Therefore, 

E n (t + W n (t)) - E n (t) < (X n (t + W n {t))) + + A n (t + W n {t)) - A n {t) 

+ A n D (t + W n (t)), 

where A^(t) = D n {t) — D n {t—) is the number of service completions (ex- 
actly) at time t. Then by (2.3) and (6.3), 

< -^=\ n W n {t) - (X n (t + W n {t))) + + E n (t + W n {t)) - E n (t) 



n 



< A n (t + W n {t)) - A n {t) + A n D {t + W n (t)), 
where A n D (t) = A£(t)/v/n. This leads to 

\fiy/EW n (t) - (X n (t + W n (t))) + \ 



(B.14) 



< 



V^[-x n -fi)w n (t) 



n 



+ \E n {t + W n (t))-E n (t)\ 



+ \A n (t + W n {t)) - A n (t)\ + A n D (t + W n {t)). 

Next we show that all terms on the right-hand side of (B.14) converge 
weakly to zero as n — > oo. Using (2.1) and Lemma 10, we get 



(B.15) 







as n — > oo. 
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For any T > 0, by (2.2) and Lemma 10, 

(B.16) sup \E n (t + W n (t)) -E n (t)\ ^0 asn^oo. 

0<t<T 

By (6.8) and Lemma 10, 

(B.17) sup \A n (t + W n (t))-A n (t)\^0 asrwoo. 

0<t<T 

Set D n (t) = (D n (t)-nfit)/^n. It follows from (4.8) that D n =>- D as n -»• oo, 
where 

D(t) = -e'M(t) + e'R [ Z(s)ds. 

Jo 

Since D is continuous almost surely, using Lemma 10 again, we have 
(B.18) sup \k n D {t + W n (t))\ ^0 asn^oo. 

0<t<T 

Combining (B.14)-(B.18), we deduce that 

(B.19) sup \nV^W n {t) - (X n (t + W n (t))) + \ ^0 asra^oo. 

0<t<T 

By (3.12), the process X is continuous almost surely; then so is (X) + . 
Because s + W n (s) < t + W n {t) for < s < t [see Lemma 3.3 of Dai and He 
(2010)] and the process {X) + is continuous almost surely, by Lemma 10 and 
the random-time-change theorem, 

(B.20) (X n {- + W n {-))) + ^{X) + as/woo. 

By (B.19), (B.20) and the convergence-together theorem [see Theorem 3.1 
of Billingsley (1999)], ^hW n (X) + /fi as n -> oo. □ 

Proof of Lemma 3. Recall that any customer who is waiting in queue 
at time t > must arrive at the system during [£"(£), t] [see (B.l) and the 
discussion therein], and must leave the queue (either goes into service or 
abandons the system) by time t + W n (t) [see Lemmas 3.2 and 3.3 of Dai 
and He (2010)]. This implies 

A^it) < A n (t + W n (t)) - A n (C(t)-). 

It follows that for any T > 0, 

sup A n Q (t)< sup \A n (t + W n (t))-A n (C(t))\ 

0<t<T 0<t<T 

+ sup \A n (C(t))-A n (C(t)-)\. 

0<t<T 
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By (B.10) and (B.17) 

sup \A n (t + W n (t))-A n (C(t))\^0 asn^oo. 

0<t<T 

By (6.8) and the fact ( n (t) < t 

sup \A n (C(t))-A n (C(t)-)\ 

0<t<T 

< sup \A n {t) -A n {t-)\ =^0 asn^oo. 

0<t<T 

Hence, (3.25) holds. □ 
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